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RUN-TIME CALL STACK VERIFICATION 



FIELD OF THE INVENTION 

[0001] The present invention relates generally to the field of computing, and, more 
particularly, to the verification of a call stack. 

BACKGROUND OF THE INVENTION 

[0002] There are certain applications of computing in which functions performed by the 
computer are sensitive and need to be protected from abuse and attack. For example, in a digital 
rights management (DRM) system that protects valuable (e.g., copyrighted) content from being used 
in an unlicensed manner, the content is typically stored in an encrypted form, and is decrypted only 
under the circumstances specified in an electronic license. Thus, the code that applies the decryption 
key to generate clear content is a sensitive piece of code, in the sense that such code should not 
divulge the decryption key, and should provide clear content only under circumstances that provide 
reasonable assurance that the clear content will not be misused or distributed in a manner that is 
contrary to license. The piece of code that performs the decryption function or any sensitive 
operations is, in some contexts, referred to as a "black box." Such a black box should resist attempts 
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by attackers to manipulate or misuse the black box binary or application which has the rights to the 
clear content in a manner that would divulge the key or the clear content. 

[0003] A black box (or other program module that performs a sensitive function) should 
implement tamper-resistant measures that are designed to thwart known types of attack on the black 
box or the application which leverages its functionality. Typical tamper-resistance measures may 
include code obfiiscation, code encryption, self-modification of code, and other known techniques 
that tend to confound analysis and manipulation of code. 

[0004] While a computer program may employ tamper-resistant measures of the type 
described above, one security weakness not easily addressed by such techniques arises at the 
boundary between two code modules. Modem programs are typically modularized into components 
that communicate with each other by way of an interface. For example, the black box described 
above can be implemented as a dynamic-link library (DLL), which can be linked to, and called by, 
one or more application programs. The advantage to such a design is that the black box's function 
can be used ("consumed") by many programs, without that function having to be implemented 
explicitly by each program, and without the inner workings of the black box having to be explicitly 
disclosed to the various third party vendors who may write the consuming programs. It also allows 
updates to the black box software to be made, for example to fix bugs or add additional tamper- 
resistance, without needing to recompile and reship all of the application programs which rely on its 
function. The disadvantage to such a design from a security standpoint, however, is that 
modularization generally facilitates and encourages the interoperation of code modules (e.g., in 
theory, any application can link to and call a DLL), while the security of the black box may depend 
on restricting the circumstances under which the black box can be called (e.g., the black box should 
not permit its decryption function to be called by applications that might disseminate valuable 
content in the clear). 

[0005] Standard tamper-resistance techniques are focused on setting up a boundary around 
a code module and protecting that boundary from breach. However, since a DLL necessarily shares 
a permeable interface with its calling application, the standard tamper-resistance techniques do not 
address the need for protection in terms of how, and under what circumstances, this interface is used 
to permit access to the DLL's functionality. 
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[0006] In view of the foregoing, there is a need for a system that overcomes the drawbacks 
of the prior art. 

SUMMARY OF THE INVENTION 

[0007] The present invention helps to prevent a code module from being used in an 
unauthorized manner, or under inappropriate circumstances. The invention operates by verifying the 
call stack in order to ensure that that the code module has been called under legitimate 
circumstances. 

[0008] When a function is called, the caller leaves a "return address" on the program stack. 
The return address is the address of the instruction to which the program counter is to be pointed 
after the called function completes execution. Since this instruction is generally the instruction that 
immediately follows the call instruction, the return address effectively identifies where the called 
function was called from. Since the return address is located inside the calling application, the return 
address can be used to identify the application that made the call. If the called code module 
maintains a record of legitimate sources that are permitted to call the code module, the return 
address effectively allows the called code module to distinguish legitimate calls from illegitimate 
ones. Thus, a code module may permit itself to be called from certain addresses within certain 
applications, and may otherwise reject attempts to be called by other sources. In the case, for 
example, of the black box that performs decryption, an illegitimate call may indicate that a rogue 
application is attempting to use the black box to perform the decryption function for nefarious 
purposes, and upon detecting such a misuse the black box can take appropriate action. 

[0009] Additional security may be provided by verifying not only that the return address is 
within the calling application, but also that the calling application has not been modified relative to 
a known state. Thus, a checksum or hash of the calling application (or of some subset of pages of 
the calling application) may be taken and stored; then, when the program module detects that it has 
been called by the calling application, this checksum or hash may be recomputed and compared 
with the stored checksum or hash to verify that the calling application has not been modified. 

[0010] Since there are certain classes of attack that take advantage of the standard calling 
convention that is used to invoke the functions of a program module (e.g., by adjusting the return 
address on the stack so that the called function will return to an unexpected location), it is preferable 
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that the program module to be protected not be directly exposed to the calling application. In a 
preferred embodiment, the calling application calls an intermediate module which, in turn, uses a 
non-standard calling convention to call or jump into the protected module. The non-standard calling 
convention is preferably designed to preserve the return address that was in effect at the time that 
the calling application made the call (so that the return address is available to, and can be verified 
by, the protected code module itself), while also being designed to cause an operating failure if the 
return address on the call stack is modified. 

[0011] Other features of the invention are described below. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0012] The foregoing summary, as well as the following detailed description of preferred 
embodiments, is better understood when read in conjunction with the appended drawings. For the 
purpose of illustrating the invention, there is shown in the drawings example constructions of the 
invention; however, the invention is not limited to the specific methods and instrumentalities 
disclosed. In the drawings: 

[0013] FIG. 1 is a block diagram of an example computing environment in which aspects 
of the invention may be implemented; 

[0014] FIG. 2 is a block diagram showing the use of a dynamic-link library by a computer- 
program; 

[0015] FIG. 3 is a block diagram of a calling function and its return address, and the 
inference of the caller's identity from the return address; 

[0016] FIG. 4 is a block diagram of a calling function and a stored checksum that can be 
used to verify non-modification of the calling function or a portion thereof; 

[0017] FIG. 5 is a block diagram of a computer program that calls a first dynamic-link 
library through a second dynamic-link library; 

[0018] FIG. 6 is a diagram showing a flow of execution when a method in a dynamic-link 
library is called in accordance with aspects of the invention; and 

[0019] FIG. 7 is a flow diagram of an example call-stack verification process in 
accordance with aspects of the invention. 
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DETAILED DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS 

Overview 

[0020] There are some computer security models that depend on protecting a given code 
module from tampering or misuse. Many security technologies focus on detecting and preventing 
code tampering in order to prevent hackers from modifying a code module to perform an unintended 
function. However, it is also desirable to ensure that an unmodified code module is called only 
under legitimate circumstances. The present invention provides mechanisms that assist a code 
module in verifying the circumstances under which it has been invoked. 

Exemplary Computing Arrangement 

[0021] FIG. 1 shows an exemplary computing environment in which aspects of the 
invention may be implemented. The computing system environment 100 is only one example of a 
suitable computing environment and is not intended to suggest any limitation as to the scope of use 
or functionality of the invention. Neither should the computing environment 100 be interpreted as 
having any dependency or requirement relating to any one or combination of components illustrated 
in the exemplary operating environment 100. 

[0022] The invention is operational with numerous other general purpose or special 
purpose computing system environments or configurations. Examples of well known computing 
systems, environments, and/or configurations that may be suitable for use with the invention 
include, but are not limited to, personal computers, server computers, hand-held or laptop devices, 
multiprocessor systems, microprocessor-based systems, set top boxes, programmable consumer 
electronics, network PCs, minicomputers, mainframe computers, embedded systems, distributed 
computing environments that include any of the above systems or devices, and the like. 

[0023] The invention may be described in the general context of computer-executable 
instructions, such as program modules, being executed by a computer. Generally, program modules 
include routines, programs, objects, components, data structures, etc. that perform particular tasks or 
implement particular abstract data types. The invention may also be practiced in distributed 
computing environments where tasks are performed by remote processing devices that are linked 
through a communications network or other data transmission medium. In a distributed computing 
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environment, program modules and other data may be located in both local and remote computer 
storage media including memory storage devices. 

[0024] With reference to FIG. 1, an exemplary system for implementing the invention 
includes a general purpose computing device in the form of a computer 110. Components of 
computer 110 may include, but are not limited to, a processing unit 120, a system memory 130, and 
a system bus 121 that couples various system components including the system memory to the 
processing unit 120. The processing unit 120 may represent multiple logical processing units such 
as those supported on a multi-threaded processor. The system bus 121 may be any of several types 
of bus structures including a memory bus or memory controller, a peripheral bus, and a local bus 
using any of a variety of bus architectures. By way of example, and not limitation, such 
architectures include Industry Standard Architecture (ISA) bus, Micro Channel Architecture (MCA) 
bus, Enhanced ISA (EISA) bus, Video Electronics Standards Association (VESA) local bus, and 
Peripheral Component Interconnect (PCI) bus (also known as Mezzanine bus). The system bus 121 
may also be implemented as a point-to-point connection, switching fabric, or the like, among the 
communicating devices. 

[0025] Computer 110 typically includes a variety of computer readable media. Computer 
readable media can be any available media that can be accessed by computer 110 and includes both 
volatile and nonvolatile media, removable and non-removable media. By way of example, and not 
limitation, computer readable media may comprise computer storage media and communication 
media. Computer storage media includes both volatile and nonvolatile, removable and non- 
removable media implemented in any method or technology for storage of information such as 
computer readable instructions, data structures, program modules or other data. Computer storage 
media includes, but is not limited to, RAM, ROM, EEPROM, flash memory or other memory 
technology, CDROM, digital versatile disks (DVD) or other optical disk storage, magnetic 
cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices, or any other 
medium which can be used to store the desired information and which can accessed by computer 
110. Communication media typically embodies computer readable instructions, data structures, 
program modules or other data in a modulated data signal such as a carrier wave or other transport 
mechanism and includes any information delivery media. The term "modulated data signal" means 
a signal that has one or more of its characteristics set or changed in such a manner as to encode 
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information in the signal. By way of example, and not limitation, communication media includes 
wired media such as a wired network or direct-wired connection, and wireless media such as 
acoustic, RF, infrared and other wireless media. Combinations of any of the above should also be 
included within the scope of computer readable media. 

[0026] The system memory 130 includes computer storage media in the form of volatile 
and/or nonvolatile memory such as read only memory (ROM) 131 and random access memory 
(RAM) 132. A basic input/output system 133 (BIOS), containing the basic routines that help to 
transfer information between elements within computer 110, such as during start-up, is typically 
stored in ROM 131. RAM 132 typically contains data and/or program modules that are immediately 
accessible to and/or presently being operated on by processing unit 120. By way of example, and 
not limitation, FIG. 1 illustrates operating system 134, application programs 135, other program 
modules 136, and program data 137. 

[0027] The computer 110 may also include other removable/non-removable, 
volatile/nonvolatile computer storage media. By way of example only, FIG. 1 illustrates a hard disk 
drive 140 that reads from or writes to non-removable, nonvolatile magnetic media, a magnetic disk 
drive 151 that reads from or writes to a removable, nonvolatile magnetic disk 152, and an optical 
disk drive 155 that reads from or writes to a removable, nonvolatile optical disk 156, such as a CD 
ROM or other optical media. Other removable/non-removable, volatile/nonvolatile computer 
storage media that can be used in the exemplary operating environment include, but are not limited 
to, magnetic tape cassettes, flash memory cards, digital versatile disks, digital video tape, solid state 
RAM, solid state ROM, and the like. The hard disk drive 141 is typically connected to the system 
bus 121 through a non-removable memory interface such as interface 140, and magnetic disk drive 
151 and optical disk drive 155 are typically connected to the system bus 121 by a removable 
memory interface, such as interface 150. 

[0028] The drives and their associated computer storage media discussed above and 
illustrated in FIG. 1, provide storage of computer readable instructions, data structures, program 
modules and other data for the computer 1 10. In FIG. 1, for example, hard disk drive 141 is 
illustrated as storing operating system 144, application programs 145, other program modules 146, 
and program data 147. Note that these components can either be the same as or different from 
operating system 134, application programs 135, other program modules 136, and program data 
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137. Operating system 144, application programs 145, other program modules 146, and program 
data 147 are given different numbers here to illustrate that, at a minimum, they are different copies. 
A user may enter commands and information into the computer 20 through input devices such as a 
keyboard 162 and pointing device 161, commonly referred to as a mouse, trackball or touch pad. 
Other input devices (not shown) may include a microphone, joystick, game pad, satellite dish, 
scanner, or the like. These and other input devices are often connected to the processing unit 120 
through a user input interface 160 that is coupled to the system bus, but may be connected by other 
interface and bus structures, such as a parallel port, game port or a universal serial bus (USB). A 
monitor 191 or other type of display device is also connected to the system bus 121 via an interface, 
such as a video interface 190. In addition to the monitor, computers may also include other 
peripheral output devices such as speakers 197 and printer 196, which may be connected through an 
output peripheral interface 195. 

[0029] The computer 110 may operate in a networked environment using logical 
connections to one or more remote computers, such as a remote computer 1 80. The remote 
computer 180 may be a personal computer, a server, a router, a network PC, a peer device or other 
common network node, and typically includes many or all of the elements described above relative 
to the computer 110, although only a memory storage device 181 has been illustrated in FIG. 1. The 
logical connections depicted in FIG. 1 include a local area network (LAN) 171 and a wide area 
network (WAN) 173, but may also include other networks. Such networking environments are 
commonplace in offices, enterprise-wide computer networks, intranets and the Internet. 

[0030] When used in a LAN networking environment, the computer 1 10 is connected to 
the LAN 171 through a network interface or adapter 170. When used in a WAN networking 
environment, the computer 110 typically includes a modem 172 or other means for establishing 
communications over the WAN 173, such as the Internet. The modem 172, which may be internal 
or external, may be connected to the system bus 121 via the user input interface 160, or other 
appropriate mechanism. In a networked environment, program modules depicted relative to the 
computer 1 10, or portions thereof, may be stored in the remote memory storage device. By way of 
example, and not limitation, FIG. 1 illustrates remote application programs 185 as residing on 
memory device 181. It will be appreciated that the network connections shown are exemplary and 
other means of establishing a communications link between the computers may be used. 
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Calling of a dynamic-link library by a program module 

[0031] A computer program typically makes use of modules that have a separate and 
independent existence from the computer program itself. For example, when an application program 

135 (shown in FIG. 1) executes, that application program may make use of a program module 136 
(also shown in FIG. 1) during that execution. FIG. 2 shows how a computer program may make use 
of a separate program module. 

[0032] In FIG. 2, application program 135 comprises a series of instructions that are 
located at some known place in memory. During the execution of application program 135, an 
instruction 200 is encountered to call a method that is not located within the boundaries of 
application program 135. ("Not located within the boundaries/' in this context, means that the called 
method is located outside of the portion of memory that is allocated to application program 135.) In 
the example of FIG. 2, instruction 200 causes a method in program module 136 to be called. 

[0033] Program module 136 may, for example, be a dynamic-link library ("DLL") - i.e., a 
file containing a set of executable methods (or functions) that can be loaded into memory and 
executed by a program such as application program 135. In the example of FIG. 2, program module 

136 contains methods 1 through n, and the call 202 issued by application program 135 invokes the 
second method. When call 202 is made, the processor on which application program 135 is 
executing begins to execute method 2. Method 2 will likely contain a "return" instruction; when the 
return instruction is encountered, the processor will return (204) to application program 135 and will 
resume executing application program 135 at the instruction contained at address 206. Since address 
206 is where execution resumes after the "return" instruction is encountered, address 206 is referred 
to as a "return address." As is known in the art, standard calling conventions generally store the 
return address on the stack at the time of the call. 

[0034] It will be understood that FIG. 2 shows application program 135 as the computer 
program that makes use of a separate program module; however, any type of computer program 
may use a separate program module, and an "application" is merely one example of such a 
computer program. In general, the environment in which a computer program executes may permit 
one program module (of which an "application program" is just one example) to make use of 
another program module. 
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Use of a return address to identify a caller 

[0035] Typically, return address 206 is the address of the memory location that 
immediately follows the call instruction. Since return address 206 is part of the computer program 
(or other program module) that issued the call instruction, the identity of return address 206 
essentially identifies the calling entity. Since one DLL can be called by different entities, knowing 
the return address allows identification of the entity that issued the last call instruction that was 
issued prior to entry of the DLL when using standard calling conventions and an import address 
table. 

[0036] FIG. 3 shows two applications programs 135(1) and 135(2) that exist on the same 
computer as program module 136 (which may be, for example, a DLL). Application program 135(2) 
calls program module 136, and the return address of the call is the address of memory location 
206(2). (As is known in the art, the return address is typically stored at an identifiable location on 
the program stack.) Since memory location 206(2) is within application program 135(2), it can be 
inferred from examining the return address that the last call instruction was issued by application 
program 135(2). If the return address had been the address of memory location 206(1), then it could 
be inferred that the last call instruction was issued by application program 135(1), since memory 
location 206(1) is located within application program 135(1). Thus, FIG. 3 shows how the return 
address may imply the identity of the caller. 

[0037] Moreover, the return address may imply not merely the identity of the calling 
program, but also a specific location within the calling program. Since the return address is typically 
the address that immediately follows the call instruction, knowing that the return address is the 
address of memory location 206(2) may imply that program module 136 was called by a particular 
function (or even a particular line of code) within application program 136(2). 

[0038] Being able to identify the caller may have useful security implications. For 
example, program module 136 may be cryptographic software that hides a decryption key and uses 
the key to decrypt valuable content. The security model in which such cryptographic software 
participates may be such that program module 136 should only perform the decryption function for 
callers of verifiable integrity. For example, application program 135(2) may be an application that 
has been certified as being capable of protecting valuable content from illicit copying, whereas 
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application 135(1) may be an unknown application (or an application that is known to exhibit rogue 
behavior with respect to valuable content). Thus, application 135(2) should be allowed to call 
program module 136 to perform the decryption function, while application 135(1) should not be 
allowed to call program module 136. Thus, it may be useful for a called program module 136 to be 
able to identify its caller, and the return address is one mechanism by which the caller can be 
identified. 

Integrity of Identified Caller 

[0039] As described above in connection with FIG. 3, a return address can be used to 
identify a caller. As further described above, such identification of the caller has security 
implications, since there are security models that depend on certain functions being callable only by 
known programs whose behavior is understood. In this sense, identifying the caller provides a 
measure of security to the extent that an identifiable caller will exhibit known behavior. However, 
this type of security can be thwarted if the "known" caller can be modified to behave in some 
unexpected manner. For example, when security is based on requiring a function to be called only 
by a certified calling application, a hacker may modify the certified calling application to exhibit 
nefarious behavior. 

[0040] Such an attack on the security of the system can be thwarted by verifying that the 
caller's code (or at least a portion of the caller's code) has not been modified relative to some 
known state for that code. Thus, it is desirable to store a checksum, hash, or similar datum that 
represents the calling code in a known state. For example, in FIG. 4 application 135 is the calling 
application. Application 135 makes a call to an external program module 136, with a return address 
206. Program module 136 regards application 135 as a legitimate caller, and regards calls with 
return address 206 as coming from a legitimate place within application 135. Thus, at some point 
prior to the call, a checksum 404 is derived from application 135, and this checksum is stored. 
Checksum 404 may be derived from the entirety of application 135, or from the page(s) that 
immediately surround return address 206 (reference numeral 402). At the time that program module 
136 is called from 206, checksum 404 can be recomputed based on application 135 (or pages 402) 
and then compared with the stored checksum. If the re-computed and stored checksums match, it 
can be inferred that application 135 (or the relevant pages 402 of application 135) have not been 
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modified relative to a known previous state. Example mechanisms for verifying that a caller has not 
been modified are more particularly discussed below in the section entitled "module 
authentication," and in subsequent sections. 

[0041] As noted above, in one security model program module 136 performs a sensitive 
function (e.g., the decryption of valuable information), and should only perform this function for a 
caller whose behavior is known and trusted (e.g., an application that will render the valuable content 
in an ephemeral manner, but that will not allow the content to be disseminated in the clear to other 
users). Thus, the caller's return address allows the caller to be identified so it can be determined 
whether the caller is a known and trusted caller; additionally, verifying checksum 404 allows it to be 
determined that the caller's code (and, therefore, its behavior) remains in the state that the code was 
in at the time the checksum was created (at which time the caller's behavior, presumably, was 
believed to be trustworthy). 

Indirect calling of program module 

[0042] When a program module needs to be protected from misuse or tampering, security 
benefits can be derived from exerting tight control on the manner in which the program module can 
be entered. One way to achieve this control is to prevent the program module from being called 
directly by applications. Instead, an intermediate module can be exposed to applications, and all 
calls to the protected program module may be made by the intermediate module on behalf of the 
application that calls the intermediate module. This structure ensures that the protected program 
module is always called by a known entity. Moreover, as more particularly discussed below, the 
known entity can use a special calling convention to enter the protected program module, and this 
calling convention can be designed to provide additional security. 

[0043] Thus, in the example of FIG. 5, program module 136 is a dynamic-link library 
(DLL) that performs sensitive functions that should be performed under controlled circumstances 
(e.g., decryption of valuable information). Application 135 (depicted, by way of example, as a "3rd 
party client application") is a software application that calls on program module 136 to perform 
such functions. (Program module 136, in the example of FIG. 5, is a "black box," which is 
discussed above; additionally, in the example of FIG. 5, the black box is implemented as a DLL 
named "secrep.dll".) Application 135, however, does not call program module 136 directly, but 
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rather calls program module 136 through an intermediate module 502. In the example of FIG. 5, 
intermediate module 502 is a DLL named "msdrm.dll," which exposes an application programming 
interface (API) that provides DRM-related functions to applications. Program module 502 includes 
functionality called "black box stubs," which are the code segments that are callable by application 
135 and are used by intermediate module 502 to call into the black box. Preferably, the stubs are 
carefully contrived hand-coded assembly language stubs that perform a unique calling convention 
into the black box; such a calling convention is more particularly described below. (It should be 
understood that the references to DRM functionality and to the black box are merely exemplary, and 
the structure of FIG. 5 is not limited to DRM systems.) 

[0044] Thus, program module 136 is not exposed directly to application 135. There are 
certain classes of attacks that allow program module 136 to be misused by application program 135; 
these attacks are based on application program 135's having a direct interface to program module 
136, so requiring application program 135 to call program module 136 indirectly helps to guard 
against this class of attack. Preferably, intermediate module 502 uses a non-standard calling 
convention to call program module 136. An example of such a non-standard calling convention is 
more particularly discussed below. 

Calling Convention 

[0045] As discussed above, using a non-standard convention may be part of an overall 
security framework. A non-standard calling convention may thwart certain types of attacks that are 
based on the protocol by which function calls occur. Additionally, the particular non-standard 
calling convention described below has the advantage that it can preserve a caller's return address 
across more than one call boundary (i.e., if function 1 calls function 2, which calls function 3 using 
the non-standard calling convention described below, then function l's return address can be known 
to function 3; under a standard calling convention, function l's return address would be generally 
unavailable to function 3). 

[0046] In the example of FIG. 5, program module 502 is exposed to application 135, but 
program module 136 is not exposed to application 135. However, program module 502 contains 
"stubs" that can be used to enter program module 136 - i.e., application 135 invokes the stubs, and 
this invocation identifies the particular functionality from program module 136 that application 135 
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would like to perform. The stubs, in turn, invoke the relevant code in program module 136. Program 
module 136, in this example, includes the code need to verify and authenticate the caller (i.e., 
application 135), and the stubs contain data required by the verification and authentication code. 
Preferably, this data is mixed into the stubs' instruction streams. The call into program module 136 
passes the address of this data to the verification and authentication code "surreptitiously." That is, 
data used by the verification and authentication code is inserted into the instruction stream 
immediately after the call to program module 136; since the call causes the address immediately 
following the call instruction to be pushed onto the stack as a return address, the stack contains a 
pointer to the data in what would be the return address using a standard calling convention. If a 
standard tool were to patch the stub, then the return address will no longer point to the actual data 
that is needed by the verification function. The address of the stub itself is also preserved and the 
entire stub is verified by the code that ultimately executes the call (or jump) into the code providing 
the requested functionality, for example a content decryption request, in program module 136. 

[0047] On the x86 processor architecture, the DRMxxxStub therefore looks like (in 
pseudo-assembly): 

push cArgs ; number of DRMxxx function parameters 

call dword ptr [g_pf nBBJump] ; call blackbox "demux" authenticator 
0x07 ; data injected in instruction stream 

[0048] The return address pushed on to the stack as a result of the w cal 1 " will actually 

point at the " 0x0 7 " data. The blackbox demux authenticator will not actually return to that 

location, but will simply use the data and return directly to the calling application instead. This 

prevents standard profilers and binary patching tools from adding logging or redirection code at this 

interface, and prevents debuggers from adding breakpoints. It should be noted that data such as 

" 0x0 7 " is different for each stub and will be interpreted as a random instruction opcode by 

disassemblers (in this case, "pop es"), which is also a benefit. Since most instructions are longer 

than one byte, the bogus opcode will cause further mistakes in the disassembly as parts of other 

stubs are interpreted as part of that instruction (i.e. the disassembler gets "out of alignment"). 

[0049] On an x86, the following is the call stack layout upon entering BBJump: 
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// 
// 
// 
// 
// 
// 
// 
// 
// 
// 
// 
// 
// 
// 
// 



DRMxxx Stub Return Address 
Parameter Depth (cArgs) 
Ret Addr App/msdrm. dll 
DRMxxx Parameter 1 



DRMxxx Parameter n 



App/msdrm. dll local stack 



esp 

esp+ 4h 
esp+ 8h 
esp+ 8h+4h 

esp+ 8h+nDepth 
esp+OCh+nDepth 



[0050] BBJump uses the DRMxxx Stub return address to verify the stub has not been 
modified and to access the embedded data, which is actually a vtable entry descriptor for the desired 
DRM service. This descriptor and the "Parameter Depth" are used by the demux to call the desired 
function, DRMxxxPub, inside the blackbox and to prepare the stack for return to the application (i.e. 

resetting the stack to adhere to stdcall calling convention). The vtable is a covered (i.e. 

encrypted) list of function pointers inside the binary, in random order (i.e. encrypted and ordered 
uniquely for each blackbox). 

[0051] After verification, the application return address is inserted in the stack below the 
DRMxxx parameters, so that BBJump will return directly to the application rather than the stub. The 
DRMxxx Stub Return Address is replaced on the stack with the address of the DRMxxxPub 
address, and the cArgs is replaced with a return address inside BBJump for cleanup. The following 
shows the stack after these replacements have been performed: 
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// I 1 

// | DRMxxxPub Address | esp 

// | I 

// | BBJump_Cleanup address | esp+ 4h 

// I I 

// | Ret Addr App/msdrm. dll | esp+ 8h 

// | I 

// | DRMxxx Parameter 1 | esp+ 8h+4h 

// I I 

//I ... I 

// | | 

// | DRMxxx Parameter n | esp+ 8h+4*cArgs 

// | I 

//• | Ret Addr App/msdrm. dll I esp+0Ch+4 *cArgs 

// i I 

// | App/msdrm. dll local stack | esp+10h+4 *cArgs 

//I I 



[0052] By doing this, the subsequent jump into the unique v-table uncovering code will 
transition into the DRMxxxPub function when it attempts to "return" to BBJump. That is, the 
address of the DRMxxxPub function is surreptitiously used by BBJump to force the blackbox to 
transition into the DRMxxxPub function without returning to BBJump and making another call. 
Since the BBJump Cleanup address has been inserted below that, DRMxxxPub will return directly 
to BBJump for cleanup, which includes obliterating sensitive data from the recently used parts of 
the stack. 

[0053] The code which performs the call stack verification expects that the DRM API was 
called through the application's Import Address Table (IAT), the default behavior when the linker 
links the binary to a DLL. The call site is verified to have one of the standard IAT transitions. In 
theory, the application should call directly to the address of the blackbox stub function which called 
into BBJump. However, the "tail-call optimization" prevents generally requiring this (i.e. the call 
might be to another part of the application which then jumps into the blackbox stub). ("Tail-call 
optimization" refers to the case where the last instruction in a first function is to call a second 
function. Since it is not necessary to be able to return to the first function to execute more 
instructions, a compiler can optimize the generated code by generating an instruction to jump into 
the second function - thereby saving the relatively high cost of performing a true function call to the 
second function. The return instruction in the second function will then cause a return not to the first 
function, but rather to the function that issued a call to the first function (or, more generally, to 
whatever context appears at the next level down the call stack). This optimization saves the expense 
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of one function call and one return, but the cost of this optimization, from the perspective of call- 
stack verification, is that the call stack cannot be used to verify that the second function was called 
by the first, since a jump into the second function is not reflected in the call stack.) Additional 
restrictions on how the application is built, however, could enable this verification. The following is 
the expected instruction stream for an I AT call: 



// I AT indirection (disp32 is an address in the I AT) : 

// ; call relative disp32 (i.e. function call within module) 

// Caller: call Thunk ; E8 EB 7E 01 00 

// call 00017EEBh ; E8 EB 7E 01 00 

// 

// ; jmp absolute indirect disp32 Mod=00 Op=100 R/M=101 

// Thunk: jmp dword ptr [disp32] ; FF 25 0C 11 CB 00 

// jmp dword ptr [OOCBllOCh] ; FF 25 0C 11 CB 00 

// or: 

// ; call absolute indirect disp32 Mod=00 Op=010 R/M=101 

// call dword ptr [disp32] ; FF 15 0C 11 CB 00 

// call dword ptr [OOCBllOCh] ; FF 15 0C 11 CB 00 

// 

// Note: The compiler uses the first convention by default across DLL 

// boundaries. However, if declspec (dllimport ) modifier is specified 

// with the function declaration (i.e. in the header for the API), the 

// compiler will resolve the function name to imp_xxx which yields 

// the second convention. 

[0054] The verification code also checks that the BBJump and blackbox stub code has not 



been tampered with. Here is the instruction stream it expects, respectively: 
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// 









// 


pbBB Jump [ 


-10] | 


8D 


// 









// 


pbBB Jump [ 


-9] | 


44 


// 









// 


pbBB Jump [ 


-8] I 


24 


// 









// 


pbBB Jump [ 


-7] 1 


FC 


// 









// 


pbBB Jump [ 


-6] | 


50 


// 









// 


pbBB Jump [ 


-5] I 


E8 


// 









// 


pbBBJump [ 


-4] I 


XX 


// 









// 






XX 


// 









// 






XX 


// 









// 






XX 


// 






— _ . 


// 


pbBB Jump [ 


0] 1 


85 


// 









// 


pbBBJump [ 


1] 1 


CO 


// 









// 
















//- 








II 








II 









II 


pbStubRA[ 


-8] I 


6A 


1 1 









II 


pbStubRA[ 


-7] 1 


XX 


II 









II 


pbStubRA[ 


-6] | 


FF 


II 









II 


pbStubRA[ 


-5] 1 


15 


II 









II 


pbStubRA[ 


-4] I 


XX 


II 









II 






XX 


II 








II 






XX 


II 








II 






XX 


II 








II 


pbStubRA[ 


0] 


XX 


II 








II 









lea eax, [esp-4h] 




-call BBCheckStack 



-test eax, eax 



I -push cArgs 




-call dword ptr [g_pf nBBJump] 
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// 

// 

// pbAppRA always points to the instruction following a call. However, the 
// call might have ultimately reached us: 

// 1) direct: call absolute indirect (reg and/or disp32) to DRMxxx 
// 2) chain: call relative to series of functions which jmp absolute 
// indirect, the last of which jmps to DRMxxx 

// I AT is a special case of 2), where the chain is only 1 deep. 2) can also 

// occur as a compiler optimization if a call to a function is the last 

// instruction of the caller function (and stack clean up is not an issue) . 

// 

[0055] FIG. 6 shows an example flow of execution when the calling process described 
above is used. Initially, client application 135 places a call to a black box stub 602, which may be 
part of the intermediate module 502 shown in FIG. 5. The black box stub then calls BBJump 604. 
At the time of the call to BBJump, the return address that was in effect at the time that the black box 
stub 602 was called is preserved so that it can later be used by the call stack verification routine 606. 
Additionally, as discussed above, data is injected into the stub instruction stream that is later used to 
identify the black box function that will be invoked after call stack verification. BBJump 604, in 
turn, calls the call stack verification routine 606. Upon return from the call stack verification routine 
606, BBJump 604 jumps into the v-table uncovering code 608 that identifies the black box function 
to be invoked; as noted above, the identity of the function to be invoked is based on the data 
injected into the stub instruction stream. The v-table uncovering code 608 then jumps into the 
function 610 that the application 135 desired to call. Upon return from this function, execution 
control is passed back to the client application 135 at the return address that was in effect at the time 
that black box stub 602 was called. It should be noted that there is preferably no return to the black 
box stub 602 itself; as described above, the call stack is preferably manipulated in a way that causes 
return to black box stub 602 (or to any portion of intermediate module 502) to be bypassed. 

Example process of verifying a call stack 

[0056] FIG. 7 shows an example process of verifying a call stack - i.e., verifying that a 
program module has been called under accepted circumstances. 

[0057] Initially, a call from an application program is received 702 at an intermediate 
module (e.g., intermediate module 502, shown in FIG. 5). The intermediate module uses a non- 
standard calling convention to call into (or jump to) 704 a program module. As discussed above, the 
non-standard calling convention resists certain classes of attacks involving the program module that 
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would be easily carried out if there were a conventional interface into the program module. The 
identity of the caller is then verified 706 based on the return address of the original call. The 
relevant return address is that which was specified when the initial call was made at 702; the non- 
standard calling convention is designed, among other things, to preserve this return address in some 
manner so that it can be verified from inside the program module that is called at 704. 

[0058] After the identity of the original caller has been identified based on the return 
address, the integrity of the caller is verified 706 based on a checksum - i.e., a checksum is 
computed based on the caller (or some portion thereof), and the computed checksum is compared 
with a stored checksum. Assuming that the caller's integrity is verified, the stack values are set 710 
so that the return address points back to the original caller (i.e., the caller who made the call to the 
intermediate module). A jump is then made 712 into the part of the program module that will 
perform the function requested (i.e., the function that the original caller desired to perform when 
making the call at 702), and, when this function is complete, a return instruction is then executed 
714 which pops the stack and returns to the return address that was placed on the stack at 710 
(which is the same as the return address specified by the original caller). 

Module Authentication 

[0059] As noted above, one feature provided herein is to authenticate the code that calls a 
program module - i.e., to verify that the code surrounding the call has not been modified relative to 
a known state. The following description relates to the module authentication procedure. 

[0060] Module authentication is a security feature that helps protect the application 
execution environment against tampering. When used properly, it provides some level of protection 
against code patching, redirection and software breakpoints. The level of protection is dependent on 
the number of modules in the applications process space that are authenticated and the frequency of 
the authentication. 

[0061] There are two phases to module authentication. Static authentication is a one-time 
check of module on-disk images while dynamic authentication is a check of the in-memory module 
images. Dynamic authentication is ongoing process since tampering can occur anytime in the life of 
the application process. 
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[0062] In practice, application performance and the openness of a Win32 process limit 
authentication to a subset of modules. The application specifies this subset of all its loaded modules 
via the application manifest. All modules that receive sensitive data should generally be 
authenticated. 

Static module authentication 

[0063] The purpose of static module authentication is to establish the authenticity of the 
on-disk images of important executable modules in the application process space. This includes 
both application and system modules. Static authentication does little to protect the execution 
environment directly, but it does serve an important purpose: 

• It gives some assurance of the identity of the running application beyond just a module name 
matching. 

• It provides a basis for determining if the in-memory image of the application is authentic (a 
basis for dynamic module authentication). 

Standard static module authentication 

[0064] For standard static authentication, the PE (portable executable) file on the persistent 
storage device (which is usually a disk, but can also be flash memory on a portable device) is 
hashed and checked against a signed hash value. To compute the file hash, the PE headers of the 
file must be parsed in order to avoid hashing data that can legitimately change after the file is 
created. Examples are timestamps and global data sections. The algorithm for hashing the PE file 
is the same one used by Windows for signing system binaries. 

Dynamic module authentication 

[0065] Dynamic module authentication is the authentication of the in-memory images of 
modules loaded in the process. While static module authentication only gives an assurance that the 
executable file on disk hasn't been tampered, dynamic module authentication gives an assurance 
that code pages in memory haven't been tampered. It is the mechanism by which the RM system 
protects the application's execution environment against attacks such as code redirection via import 
tables or inserted jump instructions and even software breakpoints (int 3's). Since code pages can be 
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tampered anytime during the life the process, dynamic module authentication is an ongoing task. 
Dynamic module authentication can be expensive and has to be managed carefully. 

[0066] The setup of dynamic authentication involves computing the expected in-memory 
page hashes of all read-only module pages. The page hashes are stored in tables for quick reference 
in subsequent page verification. The basis for establishing what the code pages should look like in 
memory is a signed disk image of the PE file itself or its corresponding prehash. Determining the 
expected page hash is complicated by the existence of relocations and import address tables. 
Authenticating relocated pages and import tables forces us to apply the logic used by the Windows 
loader. For example, if the module being authenticated was rebased by the OS, we have to apply 
relocation fix-ups to the appropriate addresses before computing each page hash. If the module has 
static imports or delay imports, we have to mimic the loader when computing the expected import 
address table (IAT). 

[0067] The verification phase involves computing hashes of actual memory pages for 
comparison with expected hash values stored during dynamic module authentication setup. Two 
types of verification are enabled. First, arbitrarily-sized portions of the authenticated modules can 
be checked at run-time for verification. This checking can be performed on a random sample of 
pages to increase the likelihood of detecting tampering in a large application. The second type of 
validation enabled is that of run-time call stack validation. If it is assumed that the address of the 
instructions in the application that call into the RM system can be reliably determined, dynamic 
module authentication enables targeted verification of the application. The address of the caller can 
be validated to come from within an "allowed" module. This prevents calls into the RM system 
directly from untrusted code. In addition, the memory pages surrounding and containing the call 
instruction to the RM system can be authenticated to protect against tampering. This check prevents 
an adversary from using a tampered "allowed" caller. 

IAT Checking 

[0068] A simple and common way to redirect code is to modify a module's import address 
table (IAT). We attempt to prevent this obvious attack as part of dynamic module authentication. 
We protect both static IATs and delay load IATs by independently constructing ("binding") the 
IATs and comparing to the actual IAT in use. 
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[0069] It is noted that the foregoing examples have been provided merely for the purpose 
of explanation and are in no way to be construed as limiting of the present invention. While the 
invention has been described with reference to various embodiments, it is understood that the words 
which have been used herein are words of description and illustration, rather than words of 
limitations. Further, although the invention has been described herein with reference to particular 
means, materials and embodiments, the invention is not intended to be limited to the particulars 
disclosed herein; rather, the invention extends to all functionally equivalent structures, methods and 
uses, such as are within the scope of the appended claims. Those skilled in the art, having the 
benefit of the teachings of this specification, may effect numerous modifications thereto and 
changes may be made without departing from the scope and spirit of the invention in its aspects. 
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